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fS (54) Title: METHODS AND MEANS OF HISTONB METHYLATION 

Q (57) Abstract: This invention relates to the methylation of histones, in paiticuiar to a previously uncharacterised group of histone 
H3 methylases which comi»ise a SET domain and which methylale either lysine 4 within the amino tail of histone H3 or within the 
^ histone H3 core. Methylation by these meth^ases, in particular trimediylation, is shown to be important for transcriptional activity. 
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Methods and Means of Histone Methylation 



5 



This invention relates to histone methylation, in 
particular, to the methylation of specific lysine 
residues on histone H3 and to the biological effects of 



such methylation. 

10 DNA in the eukaryotic nucleus is wrapped aroiand a 

histone-core which is a protein complex involving the 
four histones H4, H3, H2B and H2A. This DIUL- histone 
structure (nucleosome) is not compatible with gene 
expression. He-prganisation of the nucleosome is 

15 required for transcription factors and RNA polymerase to 
have access to the DNA for transcription. 

Covalent post-translational modifications of the amino- 
terminal tails of histones regulate the transcriptional 
20 ^'on' or ^off ' states of chromatin and influence 
chromosome condensation and segregation. 

Such modifications include acetylation, phosphosrylation 
and methylation. 

25 

Suv39Hl is a methyltransf erase which is specific for 
lysine 9 of histone H3 (Rea et al (2000) Nature 406 593- 
598) . Methylation of lysine 9 by Suv39Hl leads to the 
recruitment of the HPl repressor protein (Bannister et al 
30 (2001) Nature 410 120-124, Lachner et al (2001) 410 116- 
120) and the formation of transcriptionally silent 
heterochromatin * 

Suv39Hl has homologues in human (Suv39H2) , Drosophila 
35 {su(var)3-9) and S. pombe {clr4) . Defects in Suv39Hl and 
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its homologues have been correlated with defects .in 
transcription and abezxcuat mitotic division and 
chromosomal mis-segregation. 



5 Repression of the Suv39hl protein has also been shown to 
inhibit cell proliferation of mammalieoi cells and its 
phosphorylation status is known to chcuige as cells 
progress from Gl to S* 

10 The Suv39hl methyl transferase comprises an evolutionarily ' 
conserved domain called a SET domain (SUVSS Snhancer of 
Zest and TrithorcLK) • However, SET domains are also found 
in other proteins which do not have histbne methylase 
activity (Rea et al. (2000)). 

15 

A histone methylase activity has also been identified in 
yeast Clr4 this activity has been shown to be essential 
for transcriptional repression (Bannister et al (2001) 
Nature 410 120-124) . Lack of Clr4 methylase activity 
20 activates hete2x>chromatically repressed genes at 

centromeres and may lead to chromosome mis -segregation. 

The present inventors have discovered that a group of 
previously \incharacterised SET domain containing proteins 
25 are histone H3 methylases. This group includes S. 
cerevisiae SETl (YHR119W) and its human homologues 
hSETl.lP (KIAA0339) and hSBT1.2P (KIAA0339) angl S. 
cerevisiae SET2 (YJL168C) and its human homologues hSET2 
(AJ238403) and WHSCl (XP_003388) . 

30 

The present inventors have further discovered that these 
methylases have activities which are distinct from 
Svcv39hl axid other previously characterised histone 
methylases. SETl and related proteins specifically 
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methylate lysine 4 of histone H3 while SET2 and related 
proteins specifically methylate a lysine residue outside 
the amino tail region of histone H3 (i.e. a core lysne 
residue) . 

5 

Various aspects of the present invention provide for the 
use of a SET polypeptide and a histone H3 polypeptide in 
screening methods and assays for agents which modulate 
methylation of histone H3 by the SET polypeptide, and 
10 which may therefore be useful in the miodulating cellular 
proliferation, for example in treating conditions such as 
cancer. 

In a general aspect, the present invention provides an 
15 assay method for an agent with ability to modulate, e.g. • 
disrupt, interfere with, or increase interaction and/or 
binding of histone H3 with a SET polypeptide, the method 
including : 

(a) bringing into contact a SET polypeptide and a 
20 histone H3 polypeptide; and, 

(b) determining binding and/or interaction between the 
histone H3 polypeptide coid the SET polypeptide. 

An assay may be carried out in the presence of a test 
25 compound under conditions in which, in the absence of the 
test compound, the SET polypeptide will interact or bind 
with histone H3 . 

The precise format of the assay of the invention may be 
30 varied by those of skill in the art using routine skill 
and knowledge. For exait5>le, the interaction between the 
polypeptides may be studied in vitro by labelling one 
with a detectable label and bringing it into contact with 
the other which has been immobilised on a solid support. 
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Suitable detectaO^le labels include ^^S -methionine which 
may be incorporated into recombinant ly produced peptides 
and polypeptides. Recombinantly produced peptides and 
* polypeptides may also be expressed as a £usion protein 
S containing an epitope which can be labelled with an 
antibody. 

Fusion proteins may be generated that incorporate six 
histidine residues at either the N- terminus or C-terroinus 

10 o£ the recombinant protein. Such a histidine tag may be 
used for ptirification o£ the protein by using 
commercially available columns which contain a metal ion, 
either nickel or cobalt (Clontech, Palo Alto, CA, USA) . 
These tags also serve for detecting the protein using 

IS commercially available monoclonal antibodies directed 

against the six histidine residues (Clontech, Palo Alto, 
CA, USA) • 

A protein may be itranobilized on a solid suppoxt using an 
20 antibody against that protein bound to a solid support or 
via other technologies which are known per se. A 
preferred in vitro interaction may utilise a fusion 
protein including glutathione-S- transferase (GST) • This 
may be immobilized on glutathione agarose beads. In an 
25 in vitro assay format of the type described above, a test 
compound can be assayed by determining its ability to 
diminish the amount of labelled peptide or polypeptide 
which binds to the immobilized GST- fusion polypeptide. 
This may be determined by fractionating the glutathione- 
30 agarose beads by SDS-polyacrylaimide gel electrophoresis. 
Alternatively, the beads may be rinsed to remove xinboxind 
protein and the amoxmt of protein which has bound can be 
determined by counting the amount of label present in, 
for example, a suitable scintillation counter. 
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An assay according to the present invention may also take 
the form of an In vivo assay. The in vivo assay may be 
performed in a cell line such as a yeast strain in which 
5 the relevant polypeptides or peptides are esqpressed from 
one or more vectors introduced into the cell* 

Various methods and uses of modulators which inhibit, 
potentiate, increase or stimulate methylation of hiistone 
10 H3 by a SET polypeptide are provided as ftxrther aspects 
of the present invention. 

The purpose of disruption, interference with or 
modulation of the methylation of his tone H3 by a SET 
IS polypeptide may be to modulate cellular functions such as 
trainscription and proliferation which are mediated by 
virtue of such methylation, such as transcription, as 
discussed above and further below* 

20 A method of screening for a substance which modulates 

activity of a SET polypeptide may include contacting one 
or more test substances or agents with the SET 
polypeptide in a suitable, reaction medium, testing the 
activity of the treated polypeptide and comparing that 

25 activity with the activity of the polypeptide in 
comparable reaction medium untreated with the test 
substance, substances or agent. A difference in activity 
between the treated and untreated SET polypeptides is 
indicative of a modulating effect of the relevant test 

30 substance or substances. 



Thus another aspect of the present invention provides an 
assay method for an agent with ability to modulate, e.g. 
disrupt, interfere with, increase or stimulate the 
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histone H3 methylase activity of a SET polypeptide, the 
tnethod including: 

(a) bringing into contact a SET polypeptide and a test 
compoiind; and, 

(b) determining the histone H3 methylase activity of the 
SET polypeptide. 

An assay may be carried out under conditions in which, in 
the absence of the test compound, the SET polypeptide 
will possess histone H3 methylase activity. Histone H3 
methylase activity may be determined as described herein. 

The SET polypeptide and the test compound may, for 
exanqple, be brought into contact in the presence of a 
histone H3 polypeptide. Histone H3 methylase activity may 
then be determined by determining the presence of one or 
more methyl groups at a lysine residue of the histone H3 
polypep t ide . 

In another aspect, the present invention provides a 
method for identifying and/or obtaining an agent with 
ability to modulate, e.g. disrupt, inhibit, interfere 
with, increase or stimulate methylation of histone H3 by 
a SET polypeptide, the method including: 

(a) bringing into contact a SET polypeptide and a 
histone H3 polypeptide in the presence of a test 
compoimd; and, 

(b) determining methylation of the histone H3 
polypeptide . 



assay may be carried out under conditions in which, in 
the absence of the test confound, the SET polypeptide . 
will methylate histone H3 . 
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Determining methylation of the histone H3 polypeptide may 
comprise detesrmining the presence or absence of one or 
more methyl groups on a lysine residue of said histone H3 
polypeptide, for example lysine 4 or a core lysine 
5 residue • 

Where the SET polypeptide is a SBTl polypeptide, 
methylation of the lysine 4 residue of histone H3 may be 
determined. Where the SET polypeptide is a SET2 

10 polypeptide, methylation of a lysine residue selected 

from the group consisting of lysine 27, lysine 36, lysine 
37, lysine 42, lysine 56, lysine 64, lysine 79, lysine 
115, lysine 121, lysine 122 and lysine 125 within the . 
core of histone H3 may be determined. More preferably, 

15 where the SET polypeptide is a SET2 polypeptide, 
methylation of lysine 36 may be determined. 

A test compound which increases, potentiates, stimulates, 
disarupts, reduces, interferes with or wholly or partially 
20 abolishes methylation of the histone H3 polypeptide and 
which may thereby modulate SET polypeptide activity, may 
thus be identified and/ or obtained. 

Methylation may be determined according to known methods 
25 described herein. 

Agents which increase or potentiate methylation of 
histone H3 may be identified using conditions which, in 
the absence of a positively-testing agent, prevent 
30 methylation. Such agents may be used to potentiate the 
function of a SET polypeptide and may have an effect, for 
exanqple, on transcription and/or DNA replication. 
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In methods of the present invention, a histone H3 
polypeptide may brought into contact with a SET 
polypeptide in the presence o£ a suitable substrate such 
as SAM {S-adenosyl- (methyl) -L-raethionine) • The substrate 



5 maybe labelled, e.g. (S-adenosyl- (methyl-"c) -L- 

methionine) and the amount of label on the histone H3 
after incubation with SET polypeptide determined. 

Methods described herein may be useful in determining the 
10 presence of, euid optionally quantifying, the amount of 
SET polypeptide in a test sample . This may have a 
diagnostic or prognostics purpose, e.g. in the diagnosis 
or prognosis of any medical condition discussed herein . 
(e;g. a proliferative disorder such as cancer) or in the 
IS evaluation of a therapy to treat such a condition. 

The characterisation of the histone H3 methylase activity 
of a SET polypeptide as described herein and its role in 
the regulation of cellular proliferation and 

20 transcriptional activity allows the use of materials and 
methods, such as are disclosed and discussed above, for 
establishing the presence or absence in a test sample, 
for example, obtained from an individual, of aberrant, 
i,e, increased, reduced or abolished SET polypeptide 

25 histone H3 methylase activity. Such aberrant activity may 
be determined as described herein for the purpose of 
diagnosing a predisposition of axi individual to a 
condition associated. with cellular proliferation or for 
diagnosing an individual as suffering from a condition 

30 associated with cellular proliferation, such as cancer. 
The presence of aberrant SET polypeptide histone H3 
methylase activity being indicative of the individual 
having a condition associated with cellular proliferation 
or being predisposed i.e. having an increased 
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susceptibility to a condition associated with cellular 
proliferation . 



Aberrant expression may be detected at the protein level, 
S by determining the histone H3 methylase activity of a SET 
polypeptide, as described herein, for example, the 
presence or absence or amount of methylase activity or at 
the nucleic acid level (i.e. DNA or RHA) , by determining 
the presence of a mutant, variant or allele of a SHT gene 

10 which encodes a SET polypeptide which has aberrant - 
activity or which expresses aberrant i.e. abolished, 
reduced or increased levels of SET polypeptide. The 
.presence or amount of SET polypeptide esqpression may be 
determined by determining the presence and/ or amount of 

IS mRNA encoding the polypeptide. 

In assay methods according to the invention, the amoiont 
of test substance or compound which may be added to an 
assay of the invention will normally be determined by 

20 trial and error depending upon the type of Compound used. 
Typically, from about 0.001 nM to ImM or more 
concentrations of putative inhibitor, con^imd may be 
used, for example from 0.01 nN to lOOpM, e.g. 0.1 to 50 
pM, such as about 10 ^M. Greater concentrations may be 

25 used when a peptide is the test stibstance. Even a 

molecule which has a weak effect may be a useful lead 
compound for further investigation and development. 

A test siibstance, conpound or agent used In an assay 
30 method as described herein may be comprised in a sample, 
mixture or extract, for example, a biological sample. 

A screening or assay method may include purifying and/or 
isolating a test substance cuid/or substance of interest 
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from a mixture or extract, i.e. reducing the content of 
at least one component of the mdLxture or extract, e.g. a 
component with which the test siabstance is naturally 
associated. The screening or assay method may include 
5 determining^ the ability of one or more fractions of a 
test mixture or extract to modulate the methyl ase 
activity of the SET polypeptide. 

The purification and/or isolation may employ any method 
10 known to those skilled in the art. 

The precise format of any of the screening or assay 
methods of the present invention may be varied by those 
of skill in the art using routine skill and knowledge. 
IS The skilled person is well aware of the need to employ 
appropriate control experiments . 

Compoiinds which may be screened using* the assay methods 
described herein may be natural or synthetic chemical 
20 compounds used in drug screening programmes. Extracts of 
plants, microbes or other organisms, which contain 
several characterised or uncharacterised components may 
also be used. 



25 Combinatorial library technology provides an efficient 
way of testing a potentially vast number of different 
substances for ability to modulate an interaction. Such 
libraries and their use are known in the art, for all 
mcinner of natural products, small molecules and peptides, 

30 among others. The use of peptide libraries may be 
preferred in certain circtunstances . 



Methods of determining the methylation of histone H3 by a 
SET polypeptide and of screening for an agent able to 
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modulate the methylation of histone H3 by a SET 
polypeptide, include methods in which a suitable end- 
point is used to assess interaction. 

5 ^ere the SET polypeptide is a SETl polypeptide, suitable 
end points include the determination of the methylation 
of the lysine 4 residue of histone H3 using methods as 
described herein. 

10 Where the SET polypeptide is a SBT2 polypeptide, suitable 
end points include the determination of the methylation 
of histone H3 outside the N terminal 26 amino acids (i.e. 
at a core lysine residue) using methods as described 
herein, for example methylation of lysine 36. 

15 

Methylation may be determined by any convenient method 
known to a skilled person. For example , histone H3 
polypeptide or varicint or derivative thereof, may be 
immobilised e.g. on a bead or plate, and methylation of 

20 the appropriate residue detected using an antibody or 

other binding molecule which binds the N terminal region 
of histone H3 with a different affinity when the residue 
is methylated from when residue is not methylated. 
Antibodies may also be used to determine the level of 

25. methylation e.g. di- or tri -methylation, of a histone H3 
lysine residue. Antibodies may be obtained by means of 
any standard technique as discussed elsewhere herein, 
e.g. using a methylated peptide (such as em N terminal 
fragment of histone H3 or a fragment with the N terminal 

30 amino acids deleted) . 



Binding of a molecule which discriminates between the 
methylated and non^methylated form of a histone H3 
polypeptide or between different degrees of methylation 
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may be assessed using any technicjue available to those 
skilled in the art, which may involve determination of 
the presence of a suitable label. 

5 Methylation may also be assayed in solution, e.g. as 
described in Rea et al (2000), JViature, 406: 593-599. 
Briefly, lOug of free his tone substrate (mixture of HI, 
H2, H3, and H4; Boehringer Mannheim) is mixed with 300nCi 
S-adenosyl- [methyl^^^C] **L«-methionine (25 uCi 'ml" 

.10 ^) (Araersham) as methyl donor in methylase activity buffer 
(SOmM Tris.HCl pH8.5, 20mM KCl, lOmM MgCla, lOmM p- 
mercaptoethanol, 250mM sucrose), to give a final volume* 
of SOpl. lOug of SET polypeptide preparation is added 
and the reaction incubated at 37^C for 60 mins. The 

IS reaction products are then resolved by SDS-PAGB and 

viewed following fluorography of the gel. Alternatively, 
following SDS-PA6S, the resolved proteins can be Western 
blotted to a nitrocellulose membrane, which is then dried 
and exposed to film. 

20 

Methylation may also be determined as described below in 
the examples. 

Of course, the person skilled in the art will design any 
25 appropriate control experiments with which to- coiiqpare ' 
results obtained in test assays. 

A histone H3 polypeptide may be a fiill-length histone'H3 
protein from a eukaryotic cell, such as a yeast or a 
30 meumnal, for exaniple a human. The term also includes 
fragments of the full-length protein sequence, such as 
fragments which comprise the lysine 4 residue, for 
example, fragments comprising the N terminal residues of 
the f\ill length protein. Other fragments may coitqorise 
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residues from the core of histone H3, i.e. outside the 
amino tail region, such as fragments which comprise a 
lysine residue selected from the group consisting of 
lysine 27, lysine 36, lysine 37", lysine 42, lysine 56, 
lysine 64, lysine 79, lysine 115, lysine 121, lysine 122 
euid lysine 125, for example, a fragment cpmprising lysine 
36. 

A SET polypeptide may be a full-length SET protein or a 
fragment thereof which retains the methylase activity of 
the full length SET protein, . for example a polypeptide 
which conqprises the SET domain. 

Examples of SET domains are shown in Figures 1 and 2. A 
SET domain may comprise three conserved motifs; NHSC, 
6EXSY and Erich (Rhea S. et al (2000) Nature 406 593-- 
,599) . These motifs are indicated in Figure 1 and Figvire 
2. 

A SET domain of a SETl polypeptide may further comprise 
conserved motifs I, II, III, IV and V as indicated in 
Figure 1 and Figure 2. 

A SET polypeptide' may be a member of the SBTl or the SET2 
families of histone H3 methylases (HMTs) . 

SETl polypeptides specifically methylate the lysine 4 
residue of histone H3 and include S. cerevisiae SBTl 
(Database Accession Number YHR119W) , S. pombe SBTl 
(SPCC306.04C) , hSET-lP (Human; KIAA0339) , hSBT-2P (Human: 
KIAA1076), HRX, TRX2 , KIAA1090, and ALR (all on public 
databases) . SETl polypeptides may comprise a SET domain 
having greater than 35%, greater than 40%, greater than 
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45% or greater than 50% sequence identity with the SET 
domain of S. cerevisiae SETl. 

An aspect o£ present invention provides a SBTl 
5 polypeptide which methylates lysine 4 o£ a histone H3 
polypeptide . 

SET2 polypeptides specifically methylate a lysine residue 
selected from the group consisting of lysine 27, lysine' 

10 36, lysine 37, lysine 42, lysine 56, lysine 64, lysine 
79, lysine 115, lysine 121, lysine 122 and lysine 125 
i.e* a 'core lysine of histone H3 outside* the N terminal 
26 amino acid tail. More pre.f erably, ' a SST2 polypeptide 
specifically methylates lysine 36. SET2 polypeptides 

IS include S. cerevisiae SET2 (YJL168C) , S. pombe SET2 

(SPAC29B12) , hSET2 (AJ238403), WHSCl (XP_003388) , Q9H6H8, 
NSDl, NSD3 and ASHl (all on pxjblic databases) . 

Another aspect of present invention provides a SBT2 
20 polypeptide which methylates a core lysine residue of a 
histone H3 polypeptide. 

SET2 polypeptides may comprise a SET domain having 
greater than 35%, greater than 40%, greater than 45% or 
25 greater than 50% sequence identity with the SET domain of 
S * cerevisiae S£T2 . 

A histone H3 or SET polypeptide may, include any suitable 
fragment, variant, derivative, allele or homologue of 
30 histone H3 or the SET polypeptide, which may be employed 
in a method described herein. SuitaO^le fragments, 
variants or derivatives of histone H3 retain the 
biological activity of being methylated by a SET 
polypeptide i.e. they include the appropriate sites of 
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methylation by the SET polypeptide, for exanqple, lysine 4 
or other target residues within the histone H3 core 
(outside the amino tail residues 1 to 26) , for example 
lysine 36, Suitable variants or derivatives of a SET 
S polypeptide retain the histone H3 methylase activity.' 

A polypeptide which is an amino acid sequence variant, 
allele, derivative or mutant of an amino acid sequence . 
described . herein may comprise an amino acid sequence 

10 which shares greater than about 60% sequence identity 

with the sequence shown, greater than about 70%, greater 
than about 80%, greater than about 90% or greater than 
about 95%. The sequence may share greater than about 70% 
similstrity, greater than about 80% similarity, greater 

IS than about 90% similarity or greater than about 95% 

similarity with the amino acid sequence described herein. 

For amino acid "homology, this may be understood to be 
similarity (according to the established principles of 
20 amino acid similarity, e.g. as determined using the 
'algorithm GAP (as described below) or identity. 

Amino acid similsurity is generally defined with reference 
to the algorithm GAP (Accelerys, formerly Genetics 

25 Computer Groiap, Madison, WI) . GAP uses the Needleman and 
Wunsch algorithm to align two complete sequences that 
maximizes the number of matches and minimizes the number 
of gaps. Generally, the default parameters are used, 
with a gap creation penalty - 12 and gap extension 

30 penalty 4. Use of GAP may be preferred but other 

algorithms may be used, e.g. BLAST (which uses the method 
of Altschul et al. (1990) J", tfol. Biol. 215: 405-410), 
FASTA (which uses the method of Pearson and Lipman (1988) 
PNAS USA 85: 2444-2448), or the Smith-Waterman algorithm 
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(smith and Waterman (1981) J". Mol Biol. 147: 195-197), 
generally employing default parameters* 



Similarity allows for "conservative variation", i.e. 
5- substitution of one hydrophobic residue such as 

isoleucine, valine, leucine or methionine for another, or 
the substitution of one poleir residue for another, such 
as arginine for lysine, glutamic for aspartic* acid, or 
glutamine for asparagine. Particular amino acid sequence 
10 variants may differ from a sequence described herein by 
insertion, addition, . substitution or deletion of 1 amino 
acid, 2, 3, 4, 5-10, 10-20 20-30, 30-50,. 50-100, 100-150, 
* or more than 150 amino acids* 

IS Sequence cotiqparison may be made over the full-length of 
the' relevant sequence shovm herein, or may more 
preferably be over a contiguous sequence of about or 
greater than about 20, 25, 30,- 33, 40, 50, 67, 133, 167, 
200, 233, 267, 300, 333, 400 or more amino acids, 

20 compared with the relevant eumino acid sequence . 

As described above, peptides which include or consist of 
fragments of a full-length SET protein are enconqpassed 
within the term *SBT polypeptide' as defined herein. 
25 Where the SET polypeptide is a fragment of the full- 
length protein, suitable fragments are generally those 
which retain the methylase activity of the full-length 
polypeptide and are capable of methylating histone H3, 
except where otherwise stated. 



30 



A further aspect of the present invention provides a 
fragment of a full-length SET polypeptide as described 
herein, said fragment having histone H3 methylase 
activity. 
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Preferably such a fragment comprises the SET methylase 
domain. 

5 ' A suitable fragment preferably methylates histone H3 

outside lysine 9, more preferably at lysine 4 or a lysine 
residue selected from the group consisting of lysine 27, 
lysine 36, lysine 37, lysine 42, lysine 56, lysine 64, 
lysine 79, lysine 115, lysine 121, lysine 122 and lysine 
10 125 i.e. a core lysine residue of histone H3 located 

outside the first 26 N terminal amino acids, for example, 
the central and C terminal regions, such as lysine 36. 

A fragment of a SET polypeptide includes a fragment of a 
15 SBTl polypeptide, which may be any member of the SETl 
family, including hSET*lP (hioman KIAA0339) and hSET*2P 
(human KIAA1076} and a fragment of a SET2 polypeptide, 
which may be any member of the SET2. protein family, 
including hSET2-l, hSBT2-2, hSBT2-3 and KHSCl. 

20 

Another aspect of the present dLnvention provides an 
isolated nucleic acid molecule comprising a nucleotide 
sequence which encodes a SET polypeptide* A SET 
polypeptide may include a fragment of a full-length SET 
25 polypeptide sequence described herein. 

Such a nucleotide secjuence may be operably linked to a 
heterogeneous regulatory element, such as a promoter or 
enhancer element.. A nucleotide sequence encoding a SET 
30 polypeptide as described above may be coix^jrised within a 
vector, in particular an esqpression vector. 



Vectors con5)rising nucleic acid encoding a SET 
polypeptide may be transformed into a suitable host cell 
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as described above to provide for expression of the SET 
polypeptide. Thus, in a further aspect the invention 
provides a process for preparing a SET polypeptide which 
includes cultivating a host cell transformed or 
5 . transfected with an expression vector as described above 
under conditions to provide for esqpression by the vector 
of a coding secpience encoding the SET polypeptide, .and 
recovering the expressed polypejptide. Polypeptides may 
also be expressed in in vitino systems, such as 
10 reticulocyte lysate. 

Following production of a SET polypeptide, it may be 
tested for histone H3 methytransf erase activity, e.g. by 
determination of methylation of lysine 4 or a core lysine 

15 residue of histone H3 ^elected from the group consisting 
of lysine 27, lysine 36, lysine 37, lysine 42, lysine 56, 
lysine 64, lysine 79, lysine 115, lysine 121, lysine 122 
and lysine 125, preferably lysine 36 as described herein 
on incubation of the polypeptide with a histone H3 

20 polypeptide. 

Thus the present invention also provides a method of 
producing a SET polypeptide conqpriaing; 

esqpressing a SET polypeptide from encoding nucleic 
25 acid, and; 

determining the methyltransferase activity of the 
esqpressed SET polypeptide. 

The methyltransferase activity may include the 
30 methylation of a histone H3 residue other than lysine 9, 
for exanqole lysine 4 or a lysine residue* selected from 
the group consisting of lysine 27, lysine 36, lysine 37, 
lysine 42, lysine 56, lysine 64, lysine 79, lysine 115, 
lysine 121, lysine 122 and lysine 125 i.e. a core lysine 
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residue o£ histone H3 located outside the first 26 N 
terminal amino acids, for example lysine 36. 
Methyl transferase activity may also include the di- or 
tri- methylation of such a residue* 

5 

It is not necessary to use the entire proteins for assays 
of the invention which te^t for binding between two 
molecules. Fragments may be generated and used in any 
suitable way known to those of skill in the art. 

10 ^ Suitable ways of generating fragments include, but are 

not limited to, recombinant expression of a fragment from 
encoding TOUA. Such fragments may be generated by taking 
encoding DNA, identifying suitable restriction enzyme 
recognition sites either side of the portion to be 

IS expressed, and cutting out said portion from the DNA.. 
The portion may then be opersQ^ly linked to a suitable 
promoter in a standard commercially available expression 
system. Another recombineoit approach is to amplify the 
relevant portion of the DKA with suitable PGR primers. 

20 Small fragments (e.g. up to about 20 or 30 amino acids) 
may also be generated using peptide synthesis methods • 
which are well known in the art. 

The skilled person can use the techniques described 
25 herein and others well known in the art to produce large 
amounts of peptides, for instance by expression from 
encoding nucleic acid. 

Methods of obtaining agents able to modulate the histone 
30 H3 methylation activity of SET polypeptides include 

methods wherein a suitable end-point is used to assess 
interaction in the presence and absence of a test 
substance. 
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For methylation assays, a histone H3 or SET polypeptide 
which is a full-length protein, truncated portion, or a 
portion of fused to another protein (e.g. GST), or a 
suitable variant or derivative of any of these, may be 
used. 

Peptide methylation assays may use peptides that comprise 
the region of histone H3 which is methylated. For a SETl 
polypeptide, this may include the N- terminal region of 
histone H3 comprising lysine 4. 

For a SET2 polypeptide, this may include a lysine residue 
selected from the group consisting of lysine 27, lysine 
36, lysine 37, lysine 42, lysine 56,. lysine 64, lysine 
79, lysine 115, lysine 121, lysine 122 and lysine 125, 
i.e, a core lysine residue of histone H3 located outside 
the first 26 N terminal amino acids, for example, the 
central and C terminal regions. Preferably, for a SET2 
polypeptide, this may include lysine 36. 

The methylation of histone H3 may be assayed by any of a 
variety of procedures such as discussed below and may be 
adapted to high throughput screening approaches . Of ^ 
particular interest is the methylation of the lysine 4 
residue of histone H3 by hSBT-lP. (human KIAA0339) and/or 
hSET-2P (hxaman KIAA1076} and methylation in the central 
and/or C terminal portions (i.e. the core) of histone H3 
by hSBT2 (AJ238403) and/or WHSCl {XP_003388) . 

A method of the present invention may comprise 
identifying and/or obtaining a test conqpound which 
modulates the histone H3 methylase activity of a SET 
polypeptide. 
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Following identification of a test compound which is an 
agent or modulator as described herein, it may be 
investigated further. For example, a compound, substance 
or molecule which tests positive for ability to modulate 
5 methyl at ion of the appropriate residue of hi stone H3 

and/or the methyl ase activity of a SET polypeptide may be 
isolated, purified and/or manufactured, A method of the 
present invention may thus comprise^ isolating and/or 
purifying and/or manufacturing or synthesising the test 
10 compound. 

Furthermore, the test coirpovmd may be used in 
preparation, i.e. racmufacture or formulation, of a 
conqposition such as a medicament, pharmaceutical 
15 composition or drug. This may be administered to 

individuals. A method of the present invention may thus 
comprise formulating said compoiand with a 
pharmaceutical ly acceptable excipient as described 
herein . 

20 

In various aspects, the present invention thus provides a 
modulator identified and/or obtained by a screening 
method of the invention, e.g. a substance which inhibits 
or reduces, increases or potentiates the histone H3 * 
25 methylase activity of a SET polypeptide. 

Following identification of a modulator, the substance 
may be pxirif ied and/or investigated further and/or 
manufactured. A modulator may be used to obtain peptidyl 
30 or non-peptidyl mimetics, e.g. by methods well known to 
those skilled in the art and discussed herein. It may be 
used in a therapeutic context as discussed below. 
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. Agents according to the present invention useful in 
modulating the methylation of histone H3 and therefore 
one or more of its functions within the chromatin, may 
modulate the methylase activity of the SBT polypeptide. 
5 Such agents may specifically inhibit the ability of the 
SBT polypeptide to methylate the appropriate residue of 
histone H3 or provide the appropriate level of 
methylation. Assays and screens for such agents are 
provided in accordance with the present invention, ^along 
10 with the agents themselves and their use in modulating 
the methylation and thereby the function of histone H3. 

» 

An agent able to inhibit methylation of histone H3 by a 
SET polypeptide may include a substance able to affect 

15 the catalytic properties of the enzymatically active site 
of the methylase. An inhibitor of methylation may 
interact with the SET polypeptide within the SET 
methylase domain. Residues within this domain are 
involved in interaction with histone H3 said catalysis of 

20 the methylation. Residues outside of the domain may also 
be involved in interacting with histone H3 and agents 
which interfere with such interaction may also affect the 
methylation as disciissed elsewhere herein. 

25 Agents useful in accordance with the present invention 
may be identified by screening techniques which involve 
determining whether an agent under test inhibits or 
disrupts the methylation by a SET .polypeptide of the 
histone H3 polypeptide. 



30 



One class of putative modulator compounds can be derived 
from the SET polypeptide sequence and/ or a ligand with 
which it interacts, such as histone H3. Peptide fragments 
of these polypeptides or alleles, mutauits or derivatives 
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of such fragments are described herein. Nucleic acid 
encoding such peptides, vectors and. host cells containing 
such nucleic acid, and methods of expressing nucleic acid 
encoding such peptides are further aspects of the present 
5 invention. 

Where the histone H3 polypeptide is a fragment of the 
full length protein, suitable fragments are those which 
contain the appropriate residue which is methylated by a 

10 SBT polypeptide in the full-length polypeptide, and are 
themselves capable of - being methylated by a SET* 
polypeptide. Smaller fragments, and analogues and 
variants of these fragment may similarly be employed, 
e.g. as identified using techaigues such as delation 

IS analysis or alanine scanning. 

Fragments methylated by a SETl polypeptide include 
fragments comprising the N terminal of histone H3, in 
particular lysine 4 . Fragments methylated by a SET2 

20 polypeptide include fragments comprising a lysine residue 
selected from the group consisting of lysine 27, lysine 
36, lysine 37, lysine 42, lysine 56, lysine 64, lysine 
79, lysine 115, lysine 121, lysine 122 and lysine 125 
i.e. a core lysine residue of histone H3 located outside 

25 the first 26 N terminal amino acids, in particuleuc lysine 
36. 

Thus, the present invention provides a SBT polypeptide, 
which is preferably a peptide fragment, which is able to 
30 inhibit methylation of histone H3. 



Such peptide fragments may be obtained by means of 
deletion analysis and/or alanine scaiming of the relevant 
protein - making an appropriate mutation in sequence. 
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bringing together a SET polypeptide and a histone H3 
polypeptide in the presence o£ the mutated fragment, and 
determining methylation of the histone H3 polypeptide. 
In preferred embodiments, the peptide is short, as 



10 of cuaino acid residues of at least about five contiguous 
amino acids, often at least about seven contiguous amino 
acids, typically at least about nine contiguous amino 
acids, more preferably at least about 13 contiguous amino 
acids, and, more preferably, at least about 20 to 30' or 
- IS more contiguous amino acids. Fragments of a SET 
polypeptide may include antigenic determinants or 
epitopes useful for raising antibodies to a portion of 
the amino acid sequence. Alanine scans are commonly used 
to find' and refine peptide motifs within polypeptides, 
20 this involving the systematic replacement of each residue « 
in turn with the amino acid alanine, followed by an 
assessment of biological activity. 

Where the SET polypeptide is a SETl polypeptide and the 
25 histone H3 polypeptide is' a fragment, the fragment 
generally conqprises the lysine 4 residue. 

Where the SET polypeptide is a SET2 polypeptide and the 
histone H3 polypeptide is a fragment, the fragment 
30 generally comprises a lysine residue selected from the 

group consisting of selected from the group consisting of 
lysine 27, lysine 36, lysine 37, lysine 42, lysine 56, 
lysine 64, lysine 79, lysine 115, lysine 121, lysine 122 
and lysine 125 i.e. a core lysine residue of histone H3 



5 discussed below, and may be a minimal portion which is 
able to interact with the relevant counterpart protein 
and inhibit methylation. 



. A "fragment" of a polypeptide generally. means a stretch 
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located outside the first 26 N terminal amino acids. More 
preferably the fragment comprises lysine 36- 

Peptides in accordance with the present invention tend to 
5 be short, and may be about 40 amino acids in length or 
less, preferably about 35 amino acids in length or less, 
more preferably about 30 amino acids in length, or less, 
more preferably about 25 amino acids or less, more 
preferably about 20 amino acids or less, more preferably 

10 about 15 amino acids or less, more preferably about 10 
amino acids or less, or 9, 8, 7, 6, 5 or less in length. 
Peptides according to the present invention may be about 
10-40 amino acids in length, about 5-10, about '10-15, 
about 10-20, about 10-30, about 20-30, or about 30-40 

IS amino acids in length. Peptides which are histone H3 

fragments generally include one or more of .the relevant 
lysine residues. 

The present invention also encompasses peptides which are 
20 sequence variants or derivatives of a wild-type SET 
polypeptide sequence. Peptides which are variants of 
wild- type SET polypeptide sequence retain the ability to 
modulate methylation of histone H3 by a SET polypeptide. 

25 A SET polypeptide or histone H3 polypeptide may be a 
peptide or polypeptide which may include an amino acid 
sequence which differs by one or more amino acid residues 
from the wild- type amino acid sequence, by one or more of 
addition, insertion, deletion and sxabstitution of one or 

30 more amino acids. Thus, variants, derivatives, alleles, 
mutants cuid homologues, e.g. from other organisms, are 
included . 
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Preferably, the amino acid sequence shares homology with 
a region of the relevant SET polypeptide or histone 
H3 polypeptide as referenced herein, preferably at least 
about 60%, or 70%, or 75%, or 80%, or 85%, 90% or 95% 
5 homology. Thus, a peptide fragment of a SET polypeptide 
or histone H3 polypeptide may include 1, 2, 3, 4, 5, 
greater than 5 , or greater thaui 10 amino acid alterations 
such as substitutions with respect to the wild-ttype 
sequence. 

10 

A derivative of a peptide for which the specific sequence 
is disclosed herein may be in certain embodiments the 
same length or shorter than the specific peptide. In 
other embodiments the peptide sequence or a variant 

15 thereof may be included in a larger peptide, as discussed 
above, which may or may not include an additional portion 
of SET polypeptide or histone H3 polypeptide. 1, 2, 3, 4 
or 5 or more additional amino acids, adjacent to the 
relevant specific peptide fragment of SET polypeptide or 

20 histone H3 polypeptide, or heterologous thereto may be 
included at one end or both ends of the peptide. 

Antibodies directed to the site of binding of SET 
polypeptides form another class of putative modulators of 
25 SET methylase activity. Candidate inhibitor antibodies 
may be characterised and their binding regions detearrained 
to provide single chain antibodies and fragments thereof 
which are responsible for disrupting the binding. 

30 Antibodies may be obtained using techniques which are 
standard in the aart- Methods of producing antibodies 
include immunising a mammal (e.g. mouse, rat, rabbit, 
horse, goat, sheep or monkey) with a SET polypeptide. 
Antibodies may be obtained from immunised animals using 
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any of a variety of techniques known in the art, and 
screened, preferably using binding of antibody to antigen 
of interest. For instance, Western blotting techniques 
or iramunoprecipitation may be used (Atrtiitage et al., 
5 1992, Nature 357: 80-82)- Isolation of antibodies and/or 
antibody-producing cells from an animal may be 
accompanied by a step of sacrificing the animal. 

As an alternative or supplement to immunising a. mammal 
10 with a peptide, an antibody specific for a SBT 

polypeptide may be obtained from a recombiixantly produced 
library of expressed immunoglobulin variable domains, 
e^g. using lambda bacteriophage or filamentous 
bacteriophage which display fiinctional immunoglobulin 
15 binding domains on their surfaces; for instance see 
WO92/01047. The library may be naive, that is 
constructed from sequences obtained from an organism 
which has not been immunised with any of the proteins (or 
fragments) , or may be one constructed using sequences 
20 obtained from an organism which has been exposed to the 
antigen of interest. 

Antibodies according to the present invention may be 
modified in a number of ways* Indeed the term "antibody" 

25 should be construed as covering any binding substance 

having an immunoglobulin binding domain with the recpiired 
specificity- Thus the invention covers antibody 
fragments, derivatives, fiinctional equivalents and 
homologues of antibodies, including synthetic molecules 

30 and molecules whose shape mimics that of an antibody 
enabling it to bind an antigen or epitope. 



A hybridoma producing a monoclonal antibody according to 
the present invention may be subject to genetic mutation 
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or other changes. It will fiirther be iinderstood by those 
skilled in the art .that a monoclonal euitibody can be 
subjected to the techniqpies of recombinant DNA technology 
to produce other antibodies or chimeric molecules which 
5 retain the specificity of the original euitibody. Such 
techniques may involve introducing DNA encoding the 
immunoglobulin variable region, or the conplementarity 
determining regions (CDRs) , of an antibody to the 
constant regions, or constant regions plus framework 
10 regions, of a different immunoglobulin. See, for 
instance, EP1B4187A, GB 2188638A or EP-A-0239400. 
Cloning and esipression of chimeric antibodies are 
described in EP-A-0120694 and EP-A-0125023 . 

15 Hybridomas capable of producing antibody with desired 
binding characteristics are within the scope of the 
present invention, as are host cells, eukazryotic or 
prokaryotic, containing nucleic acid encoding antibodies 
(including antibody fragments) and capable of their 

20 esqpression. The invention also provides methods of 
production of the antibodies including growing a cell 
capable of producing the antibody under conditions in 
which the antibody is produced, and preferably secreted. 

25 The reactivities of antibodies on a sample may be 
determined by any appropriate means. Tagging with 
individual reporter molecules is one possibility. The 
reporter molecules may directly or indirectly generate 
detectable, and preferably measurable, signals. The 

30 linkage of reporter molecules may be directly or 

indirectly, covalently, e.g. via a peptide bond or non- 
covalently. Linkage via a peptide bond may be as a 
result of recombinant expression of a gene fusion 
encoding antibody and reporter molecule. The mode of 
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determining binding is not a feature of the present 
invention and those skilled in the art are able to choose 
a suitable mode according to their preference and general 
knowledge « . 

5 

Antibodies may also be used in purifying and/or isolating 
SET polypeptide, for instance following production of 
the polypeptide by expression from encoding nucleic acid. 
Antibodies may be useful in a therapeutic context (which 

10 may include prophylaxis) to disrupt binding of a SET 

polypeptide to histone H3 and inhibit the methylation of 
the target lysine residue. Antibodies can for instance 
be micro- injected i:ato cellSi Q.g* at a tumour site, 
subject to radio- and/or chemo- therapy (as discussed 

IS already above) • Antibodies may be employed in accordance 
with the present invention for oth^r therapeutic and non- 
therapeutic purposes which are discussed 
elsewhere herein. 

20 As noted, the agent may be peptidyl, e.g. a peptide which 
includes a sequence as recited above, or may be a 
functional analogue of such a peptide. 

As used herein, the expression "functional analogue** 
25 relates to peptide variants or orgeuiic compounds having 
the same fixactional activity as the peptide in question, 
which may interfere with the methylation of t.he lysine 4 
residue of histone H3 by a SET polypeptide. Bxaioples of 
such analogues include chemical compoimds which are 
30 modelled to resemble the three dimensional structure of 
the SET methylase domain in the contact area, and in 
particular the arrangement of the key amino acid 
residues . 
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In a ftirther aspect, the present invention provides the 
use of a SET polypeptide, in particular a peptide 
fragment/ which is capable of methylating the lysine 4 
residue of histone H3, in a method of desigffiing a peptide 
5 or non-peptidyl mimetic, which mimetic is able to 
interact with the SET polypeptide active site and 
modxxlate the methylation of the lysine 4 residue by the 
SET polypeptide. 

10 Accordingly, the present invention provides a method of 
designing a mimetic, for example of a histone. H3 amino 
terminal fragment, which has the biological activity of 
modulating the methylation of the lysine 4 residue by the 
SET polypeptide, said method comprising: 

15 (i) analysing a substance to determine the amino 

acid residues essential and important for the biological 
activity to define a pharmacophore; and, 

(ii) modelling the pharmacophore to design and/or 
screen candidate mimetics which modulate the methylation 

20 as described. 

Suitable modelling techniques are known in the art. This 
includes the study of the bonding between SET and histone 
H3 and the design of compounds which contain 
25 corresponding functional groups arranged in such a manner 
that they could reproduce that bonding. 

The designing of mimetics to a known phamtiaceutically 
active conqpoxind is a known approach to the development of , 
30 pharmaceuticals based on a "lead" compound- This might 
be desirable where the active compound ie difficult or 
expensive to synthesise or where it is unsuitable for a 
particular method of administration, for instance SET 
polypeptides may not be well suited as active agents for 
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oral compositions as they tend to be quickly degraded by 
proteases in the alimentasr/ canal. 

There are several steps commonly taken in the design of a 
5 mimetic from a compound having a given target property. 
Firstly, the particular parts of the compound that are 
critical .and/ or important in determining the target 
property are determined. In the case of a peptide/ this 
can be done by systematically varying the amino acid 
10 residues in the peptide, e.g. by substituting each 

residue in turn. These parts or residues constituting 
the active region of the compound are knovna as its 
"pharmacophore" . 

IS Once thie pharmacophore has been foxind, its structure is 
modelled according to its physical properties, e.g. 
stereochemistry, bonding, size and/ or charge, using data 
from a range of sources, e.g. spectroscopic techniques. 
X-ray diffraction data and NMR. Computational analysis, 

20 similarity mapping (which models the chaxge and/or volume 
of a pharmacophore, rather than the bonding between 
atoms) and other techniques can be used in this irodelling 
process . 

25 In a variant of the above approach, the three-dimensional 
structure of a ligand and its binding partner are 
modelled. This can be especially useful where the ligand 
and/or binding partner change conformation on binding, 
allowing the model to take account of this the design of 

30 the mimetic. 



A compound foiind to have the ability to affect SET 
methylase activity has therapeutic and other potential in 
a number of contexts, as discussed. For therapeutic 
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treatment such a compound may be used in combination with 
any other active substance, e.g. for anti-tumour therapy 
another ant i- tumour compound or therapy, such as 
radiotherapy or chemotherapy. 

5 

An agent identified using one or more primary screens 
(e.g. in a cell- free system) as having ability to 
modulate the methylase activity of a SET polypeptide may 
be assessed further using one or more secondary screens. 

10 A secondary screen may involve testing for a biological 
function of histone H3 methylated at the lysine 4 • 
position or a lysine residue selected from the group 
consisting of lysine 27, lysine 36, lysine 37, lysine 42, 
lysine 56, lysine 64, lysine 79, lysine 115, lysine 121, 

15 lysine 122 and lysine 125 i.e. a core lysine residue of 
histone H3 located outside the first 26 N terminal amino 
acids, preferably lysine 36. Suitable biological 
functions include the transcriptional activation of a 
chromatin region or domain which comprises or is 

20 associated with the histone H3 polypeptide and/or binding 
to a regulatory factor such as the NuRD repressor. 

Generally, a modulator according to the present invention 
is provided in an isolated and/or p\irified form, i.e. 

25 substantially pure. This may include being in a . 

composition where it represents at least about 90% active 
ingredient, more preferably at least about 95%, more 
preferably at least about 98%. Such a cotr^osition may, 
however,, include inert carrier materials or other 

30 pharmaceutically and physiologically acceptable 

excipients* As noted below, a composition according to 
the present invention may include in addition to an 
modulator conqpovind as disclosed, one or more other 
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molecules of therapeutic use, such as an anti*tumour 
agent . 

The invention further provided a method of treatment which 
S includes administering to a patient an agent as described 
herein which interferes with or inhibits the methylation 
of the target residue of histone H3 (for example lys 4 or 
a core lys residue) by a SET polypeptide « Exemplary 
purposes of such treatment are discussed elsewhere herein. 

10 

The invention further provides various therapeutic . methods 
and uses of one or more compounds or siibstances selected 
.from (i) SET polypeptide which is able to bind to histone 
H3; (ii) a modulator identified by a screening method of 
IS the present invention; (iii) a mimetic of any of thei above 
substances which can bind to histone H3 or a SET 
polypeptide • 

The therapeutic/prophylactic purpose of such a method or 
20 use may be the modulation, e.g. disruption or 

interference, of the methylation of histone H3 at 
residues other than lysine 9. Target residues include 
lysine 4' for SBTl. For a. SET2 polypeptide, this may 
include a lysine residue selected from the group 
25 consisting of lysine 27, lysine 36, lysine 37, lysine 42, 
lysine 56, lysine 64, lysine 79, lysine 115, lysine 121, 
lysine 122 and lysine 125 i.e, a core lysine residue of 
histone H3 located outside the first 26 N terminal amino 
acids, for example, within the central and C terminal 
30 regions, preferably lysine 36. 



The therapeutic/prophylactic purpose may be: 
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(i) Cancer treatment, which may for example be in 
combination with chemotherapy and/ or radiotherapy. 

(ii) Cancer prophylaxis, 

5 (iii) Treatment of other proliferative disorders 

described herein e.g. psoriasis, cataracts, multiple 
myeloma . 

In various further aspects, the present invention thus 

10 provides a pharmaceutical composition, medicament, drug 
or other composition for such a purpose, the composition 
comprising one or more such compo\ands or substances which 
modulate the activity of a SET polypeptide as described 
herein and/or the methylation of histone H3 at a lysine 

15 residue other than Lys9, the use of such a substance in a 
method of medical treatment, a method comprising 
administration of such a substance to a patient, e.g. for 
treatment (which may include preventative treatment) of a 
medical condition, e.g. a condition associated with a 

20 defect or disorder in transcriptional control, DNA 

replication, or cell cycle control, e.g. for treatment of 
a disorder of cellular proliferation such as restenosis, 
psoriasis, cataracts, multiple myeloma and cancer, 
preferably all types of solid cancers and malignant 

25 lyn^homas and especially leukaemia, skin cancer, bladder 
cancer, breast cancer, uterus cancer, ovary cancer, 
prostate cancer, lung cancer, colon cancer, pancreas 
cancer, renal cancer, stomach cancer and cerebral cancer, 
use of such a substauice in the manufacture of a 

30 composition, medicament or drug for administration for 
such a purpose, e.g. for treatment of a proliferative 
disorder, and a method of making a pharmaceutical 
composition con5>rieing admixing such a sxabstance with a 
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pharmaceutically acceptable excipient, vehicle or 
carrier, and optionally other ingredients. 

The substances may. be used as sole active agents or in 
5 combination with one another or with any other active 
substauice, e.g. for anti-tximoxir therapy another anti- 
tumour compound or therapy, such as radiotherapy or 
chemotherapy . 

10 Whatever the substance used in a method of medical 

treatment of the present invention, administration is 
preferably in a *prophylactically effective amount" or a 
"therapeutically effective amount*' (as the case may be, 
although prophylaxis may be considered therapy) , this 

15 being sufficient to show benefit to the individual. The 
actual amount administered, and rate and time -course of 
administration, will depend on the nature and severity of 
what is being treated. Prescription of treatment, e.g. 
decisions on dosage etc, is within the responsibility of 

20 general practitioners and other medical doctors. 

A substance or composition may be administered alone or 
in combination with other treatments, either 
simultaneously or sequentially dependent upon the 
25 condition to be treated, e.g. cancer. 

Pharmaceutical compositions according to the present 
invention, and for use in accordance with the present 
invention, may include, in addition to active ingredient, 
30 a pharmaceutically acceptable excipient, carrier, buffer, 
stabiliser or other materials well known to those skilled 
in the art. Such materials should be non- toxic and 
should not interfere with the efficacy of the active 
ingredient. The precise natxire of the carrier or other 
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material will depend on the route of administration, 
which may be oral, or by injection, e.g. cutaneous, 
subcutaneous or intravenous. 

5 Pharmaceutical compositions for oral administration may 
be in tablet, capsule, powder or liquid form. A tablet 
may include a solid carrier such as gelatin or an 
adjuvant. Liquid pharmaceutical compositions generally 
include a liquid carrier such as water, petroleum, animal 

10 or vegetable oils, mineral oil or synthetic oil. 
Physiological saline solution, dextrose or other 
saccharide solution or glycols such as ethylene glycol, 
propylene glycol or polyethylene glycol may be included. 

15 For intravenous, cutaneous or sulbcutaneous injection, or 
injection at the site of affliction, the active 
ingredient will be in the form of. a parenterally 
acceptable aqueous solution which is pyrogen- free and has 
suitable pH, isotonicity and stability. Those of 

20 relevant skill in the art are well able to prepare 

suitable solutions using, for example, isotonic vehicles 
such as Sodium Chloride Injection, Ringer's Injection, 
Lactated Ringer's Injection. Presearw^atives, stabilisers, 
buffers, antioxidants and/or other additives may be 

25 included, as required. 

Liposomes, particularly cationic liposomes, may be used 
in carrier formulations. 



30 Examples of techniques arid protocols mentioned above can 
be found in Remington's Pharmaceutical Sciences, 16th 
edition, Osol, A. (ed) , 1980. 
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The siibstance or coniposition may be administered in a 
localised manner to a tiimour site or other desired site 
or may be delivered in a manner in which it targets 
tumour or other cells. 

5 

Targeting therapies may be used to deliver the active 
sxabstance more specifically to certain types of cell, by 
the use of targeting systems such as euitibody or cell 
specific ligands- Targeting may be desirable for a 
10 variety of reasons, for example if the agent is 

unacceptably toxic, or if it would otherwise require too 
high a dosage, or if it would not .otheirwise be able to 
enter the target cells. 

15 Instead of administering such substances directly, they 
may be produced in the target cells by expression firom an 
encoding nucleic acid introduced into the cells, e.g. 
from a viral vector. The vector may be targeted to the 
specific cells to be treated, or it may contain 

20 regulatory elements which ctre switched on more or less 
selectively by the target cells. 

Nucleic acid encoding the siibstance e.g. a peptide able 
to modulate, e.g. interfere. with, the methylation of the 
25 lysine 4 or core lysine residue of histone H3 by a SET 

polypeptide, may thus be used in methods of gene therapy, 
for instcuice in treatment of individuals, e.g. with the 
aim of preventing or cxiring (wholly or partially) a 
disorder. 



30 



Vectors such as viral vectors have been used in the prior 
art to introduce nucleic acid into a wide variety of 
different target cells. Typically the vectors are 
exposed to the target cells so that transfection can take 



WO 027090578 ^■CT/GB02A)20SO 

38 

place in a sufficient proportion of the cells to provide 
a useful therapeutic or prophylactic effect from the 
expression of the desired peptide. The transfected 
nucleic acid may be permanently incorporated into the 
S genome of each of the targeted cells, providing long 
lasting effect, or alternatively the treatment may have 
to be repeated periodically. 

A variety of vectors, both viral vectors and plasmid . 

10 vectors, are known in the art, see US Patent No* 

5,252,479 and WO 93/07282. In particular, a number of 
viruses have been used as gene transfer vectors, 
including papovaviruses such as SV40, vaccinia virus, 
herpesviimses, including HSV and EBV, .and retroviruses. 

15 Many gene therapy protocols in the prior art have used 
disabled murine retroviruses. 

As an alternative to the use of viral vectors in gene 
therapy other known methods of introducing nucleic acid 
20 into cells includes mechanical techniques such as 
microinjection, tramsfer mediated by liposomes and 
receptor-mediated DNA transfer. 

Receptor-mediated gene transfer, in which the nucleic 
25 acid is linked to a protein ligeuad via polylysine, with 
the ligand being specific for a receptor present on the 
surface of the target cells, is an example of a technique 
for specif ically targeting nucleic acid to particular 
cells - 



30 



A peptide or other substance having an ability to 
modulate or interfere with the methylation of the lysine 
4 or core lysine residue of histone H3 by a SET 
polypeptide, a nucleic acid molecule which encodes a 
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peptide having that ability, may be provided in a kit, 
e.g. sealed in a suitable container which protects its 
contents from the external environment. Such a kit may. 
include instructions for use. 

5 

In still further aspects the present invention provides 
for the purification of a SET polypeptide, or. for the 
purification of a histone H3 polypeptide. The invention 
also provides for a purified SET polypeptide protein. 

10 The purified polypeptide may be about 10% pure, more 
preferably about 20% pure, more preferably about 30%. 
pure, more preferably about 40% pure, more preferably 
. about 50% pure, more preferably about 60% pure, more 
preferably about 70% pure, more preferably about 80%. 

15 pure, more preferably about 90% pure, more preferably 
about 95% pure, or substantially pure. 

In another aspect the present invention provides a method 
of purifying a SET polypeptide, the method including 
20 contacting the SET polypeptide with histone H3 
polypeptide. 

A mixture of material including SET polypeptide may be 
contacted against immobilised histone H3 polypeptide 
25 (e.g. immobilised either covalently or non-covalently 
such as via a specific binding molecule such as 
streptavidin or biotin) and molecules which do not bind 
to the histone H3 polypeptide are washed off. 

30 Following purification, the SET polypeptide may be used 
as desired, e.g. in an assay for an agent which modulates 
its activity, e.g. binding, in raising or obtaining a 
specific antibody or other binding molecule, or in a 
therapeutic context. 
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Other aspects of the present invention relate to the 
disruption of the interaction between histone H3 and the 
jNPuRD con5)lex by the methylation of histone H3 lysine 4 by 
5 a SBTl polypeptide. 

A method for identifying and/or obtaining an agent which 
modulates the interaction of histone H3 with a NiiRD* 
repressor, may include: 
10 (a) bringing into contact one or more components of the 
NuRD repressor complex and a histone H3 polypeptide in 
the presence of a test compound; and, 
(b) determining interaction between the histone H3 
polypeptide and the said one or more components. 

15 

The one or more components of the NxiRD . repressor complex 
and the histone H3 polypeptide may be brought together in 
the presence of a SBTl polypeptide, A reduction in the 
activity of the SETl polypeptide in methylating the lys 4 
20 residue of the histone H3 polypeptide may be determined 
by an increase in interaction and/or binding between the 
NuRD complex and the histone H3 polypeptide in the 
presence relative to the absence of the test con5>ound. 

25 Components of the N\aRD complex may include HDACl, HDAC2, 
Mi-2pr Rbap48, Rbap46^ MTAl, MTA2 and MBD3 (Zhang et al 
(1998) Cell 95(2)279-289). Histone H3 polypeptides are 
described above. 

30 Interaction and/or binding may be determined by any 
convenient techniqpxe as described above. A agent 
identified and/or obtained using such a method may be 
useful in the modulation of transcript ioxuil activation. 
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Other aspects of the present invention relate to the tri- 
methylation of histone H3 lysine 4 by a SETl polypeptide 
and the effect of lys 4 tri-methylation on 
transcriptional activation, 

5 

A method of identifying transcriptionally active 
chromatin may con5>rise; 

determining the presence or absence of a 
trimethylated Lys 4 residue in a histone H3 polypeptide 
10 of said chromatin. 

The presence of a trimethylated Lys4 residue in a histone 
H3 polypeptide of said chromatin is indicative that said 
chromatin is transcriptionally active, 

15 

The presence or absence of a trimethylated Lys 4 may be 
determined by. any convenient means. For exatrqple, said 
chromatin region may; be contacted with an antibody which 
binds specifically to a histone H3 polypeptide which is 
20 trimethylated at Lys 4. 

Various further aspects and embodiments of the present 
invention will be apparent to those skilled in the art in 
view of the present disclosxire. 

25 

Cert:ain aspects and embodiments of the invention will now 
be illustrated by way of exanqple and with reference to 
the figure described below. 

30 Figure 1 shows sequence alignments within the SET domains 
and flanking seqpiences of yeast proteins containing SET 
domains, human homlogues of SETl and the human histone H3 
lysine 9 methylase Suv39hl. hSetl.l, human KiAA0339; 
hSetl.2, human KIAA1076; sp Setl, S.poxnbe SPCC306.04C; sc 
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Setl, S.cerevlsiae Setlp; sc Set2, S.cerevisiae Set2p; 
SUV39H1, human Suvar 39H1; sc Set3, S.cerevisiae^ YJLIOBW; 
sc Set4, S.cerevisiae yKR029C; sc Sets, S.cerevisiae 



calculated as follows: Clustal W1.8 (DNA- Protein) -Global 
progressive (BCM) > BOXADB 3.21, Consensus line: No 
consensus. Fraction of sequences (that must agree for 
shading) : 0.5 . 

Figure 2 shows sequence alignments within the SET domains 
and flanking sequences of SETl and SBT2 polypeptides. 
hSetl.l, human KiAA0339; hSetl.2, human KIAA1076; so 
Sett, S.cerevisiae Setlp; spSet2, S. pombe SPAC29B12; sc 
Set2, S.cerevisiae Set2p; hSet2.1, human K1AA1732; 
hSet2.2, human HSPC069; hSet2.3, human AJ238403; WHSC17 
human WHSCl. 

Figure 3 shows the mapping of the methylation site of the 
S. cerevisiae SBTl in the hiatone H3 amino tail. 

Figure 4 shows a relationship tree between proteins which 
contain SET domains. The tree was compiled using the 
following multiple alignment parameters; program - 
ClustalW (vi.4}, similarity matrix - blosum. Open Gap 
penalty - 10.0, Extend Gap p^alty * 0.1, Delay Divergent 
- 40%, Gap distance - .8. 

Figure 5 shows a schematic diagram of varioiis methylated 
and unmethylated histone H3 peptides used to purify a 
specific set of binding proteins. The C-terminus of each 
of the peptides was followed by GGC. 



YHR207C; SC Set6, S.cerevisiae YPL16SC. Alignments were 



Figure 6 shows the deacetylase activity and released 
tritiated acetate, as measured by scintillation counting 
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(c pro = counts per minute) from proteins bound to the 
xinmethylated and methylated histone H3 peptide in pull 
down assays. 

Figure 7 shows a list of genes activated by SETl- To 
identify targets of SETl mediated eaqpression, 
transcription profiles of setlA yeast were- determined. 
Potential targets were collated from genes down regulated 
in the mutant and assigned relative merit (P- value) using 
a gene-specific error model. Secondairy effects were 
eliminated by disregarding genes also down regulated in 
general stress or in a sir2A profile fsir2A and sirlA 
yeast both eadiibit downstream effects secondary to 
derepression of the silent mating loci) . 

Table 1 shows % identity within the SET Domain among the 
SBTl and SBT2 Families. The table was conqoiled using the 
following parameters; program - MacVector 6.5, matrix - 
BLOSUM 20, Clustal alignment - pairwise alignment. Open 
Gap penalty - 1.0, Extend Gap penalty - 0.1. 

Table 2 shows a summary of the K4 methylation state at 
the promoter of constitutive genes ^ inositol regulated 
genes and methionine regulated genes. means no 
methylation detected. ^Di*' means di -methylation detected 
at the promoter and ^tri" means tri -methylation at the 
promoter. The- growth coiiditions for activation and 
repression of. the genes are described below. 

EXPERIMENTAL 
Materials and Methods 
Purification of yeast SET Protein 

Yeast Setlp and SetSp were tagged at the N-terminus with 
a Protein A epitope from S. aureus. These proteins were 
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expressed in the wild type strain RS453 {mat a, ade2, 
ura3, leu2, bia3, txpl) and in the C 13 ABYS-se strain 
(Mata ura3 leu2-3 hiB3''112 pral^l prbl-l prcl-l cpsl'3) . 
The tagged proteins with the expected size (PtA-Setlp, 
5 160Kda; PtA-SetSp^ ' 55Kda) were detected in total extract 
by western blot with Anti Protein A antibody (PAP, 
rabbit, DAKO- Z0113) 1:3000 dilution. PtA- Set Ip and PtA- 
SetSp were purified by affinity to an IgG Sepharose 
column as described (Siniossoglou, S. et al (199'6) Cell 
10 84, 265-275). For each set of assays, 5g of spheroplasts 
were homogenised in 50ml of lysis buffer. The soluble 
supernatant (S) was passed through a column containing 
200 ul of IgG beads. 

IS Avoiding the elut ion step, the beads were washed with 25 
ml of equilibration buffer. Then, 60 ill of beads were 
tranjsf erred to 3 eppendorf tubes, placed on ice, for the 
enzymatic reaction. The remaining 20 ul beads were re* 
suspended in SOyl Iiaemli buffer, vortex and boiled for 3 

20 min. Sanqples of ''soluble supernatant, S" (30iil , 

ecjuivalent to 0.06% of the total solxible supernatant) azid 
''eluted protein, B" (25pl, eqpiivalent to 5% of the total 
ptirif ied protein) were analysed by western blot with Anti 
Protein A antibody (PAP, rabbit, DAKO- Z0113) 1:3000 

25 dilution. 

Equilibration Buffer: 20mM Tris-HCl pH 8.5; 0.05 mM DTT. 



30 Methyl traiasf erase Assay 

The enzymatic assays were performed on a mixture of 
soluble histones from calf thymus (H2A, H2B, H3 and H4 ) , 
Sigma; recombinant xenopus H3 or H3 deleted for the first 
26 amino acids {Atail H3) . 
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For each reaction^ SOiag of calf histones or 20iig of 
recombinant H3 were used. The following mixtvires were 
prepared: 

5 1) 60 ul of PtA-Set p beads (either Setlp, Set2p or 
SetSp) + 30 ul of methylatioh buf f er + 2 pi of "c-SAM 
{NEC-363 adenosyl-Ii-methionine, S- (methyl -"O . 

2) 60 pi of PtA-Set p beads (either Setlp, Set2p or 
10 Set6p) + 30 ul of methylation buffer+ 3 ul of soliible 

histones (60 ug) + 2 pi of 14C-SAM. (NEC-363 adenosyl-L- 
raethionine, S- (methyl -^"C) . 

3) 60 pi of PtA-Set p beads (either Setlp, Set2p or 

^ 15 Set6p) + 30 pi of methylation buffer + 4pl of recombinant 
xH3 (20pg)+ 2pl of 14C-SAM (NBC-363 adenosyl-Lmethionine, 
S- (methyl- 14C) . 

4) 60 pi of IgG sepharose beads (previously equilibrated 
20 in methylation buffer) + 30 pi of methylation buffer + 3 

pi of soluble histones (60 pg) + 2 pi of 14C-SAM (NEC-363 
adenosyl-Ii-methionine, jS- (methyl -14C) . 

5) 60 pi of IgG sepharose beads (previoxisly equilibrated 
25 in methylation buffer) + 30 pi of methylation buffer + 

4pl of recombinant xH3 (20pg)-f 2 pi of 14C-SAM (NEC-363 
adenosyl-L-methionine^ (methyl-14C) • 

Methylation Buffer: 50mM Tris-HCl pH 8.5; 20mM K Cl, lOmM 
30 MgC12, lOmM -mercaptoethanol , O.OSmM DTT , 250mM sucrose, 
0*2% dodecyl-p-D-maltoside. 

When assayed, the recombinant xenopus H3 deleted for the 
26 first amino acids (xgH3) was used at lOpg/reaction. 
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The microfuge tubes were inciibated at 30^C during 2 hours 
with shaking. After that, 15 \xl of 4x (protein sample 
buffer) were added to each reaction (the total liquid 
S vol. of each reaction should be around 45yl) . The samples 
were boiled for 3 minutes. The histones were separated in 
a 1.5 nim, 20% acrylamide gel, run at 200 V during 1 hpur 
and 15 minutes and transferred to nitro-cellulose 
membranes by standard procedtires. The nitro-cellulose 
10 . membrsuie was esqposed to Kddcik Biomax MS film with Biomax 
Transcreen-LE intensifying screen at -80C during 36-48h. 

Mapping of Site of Kethylation within Histone H3 
Histone H3 was methylated by SBTl as described herein 
15 with a ^^"C -methyl group. The methylated residue was then 
identified using N teinoinal sequence analysis as 
• described in Rein D. and Speicher D. Current Protocols in 
Protein Science (1997) 11.10.01-11.10.38. 

20 Briefly, A stained protein band was excised from a PVDF 
membrane, sonicated in a both sonicator and sequenced 
iising a Hewlett-Packard Model 6 lOOSA seq[uencing systanci. 
Sequencing was carried out according to the. 
manufacturer's instructions and PTH ( phenyl thiohydantoin) 

25 amino acids from each sequencer cycle were separated by 
in-line HPLC to provide sequence data. 

Expression Profiling 
. The setlA mutant and its parent strain UCClOOl were grown 
30 in parallel in YPD media at ZO^^C to an ODsoo of 1-0. Poly 
(A) RMA was isolated and reverse transcribed 
incorporating amino-allyl dUTP. The resulting DNA probe 
was labeled with reactive CyS (mutant) or Cy3 (parent 
strain) dye and hybridized to a spotted cDNA microarray 
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containing the yeast open reading frames as described in 
Carroll et al (2001) ENAS 98 12578-12583 and DeRisi et al 
(1997) Science 278 680-686. Microarrays were analyzed 
using a GenePix4000A scanner and GENEPIX 3*0 software. 
- S Candidate SETI regulated genes were identified from ' 

duplicate experiments using a gene specific error model 
described previously (Hughes et al Cell (2000) 102 109- 
126).' 

10 Northern blots cuid ChlPs 

Northern blots and ChiPs were done as described in Kent 
et al Genes Dev (2001) 15 619-626; except that for ChIP 
5]il of Tri-meK4 H3 antibody and 2pl of Di-me K4 H3 
antibody were used and protein A-sepharose beads were 

IS pre-incubated with l.Syg of sonicated salmon sperm DNA 

for 30 minutes* For the ChlPs experiments, the cell were • 
grown o/n in minimal medium lacking inositol (medium - 
ino) , then diluted to OD600: 0.15 in minimal medium - 
inositol or the same one supplemented with lOOmg/L 

20 inositol (mediiim +ino) and grown to OD600: 1.2 and then 
processed for chromatin preparation • 

Samples from medixim -methionine and -fmethionine were 
prepared in the same way except that the medium was 
25 supplemented • 



Affinity purification from Hela nuclear extract 
Histone peptides which were trimethylated at different 
30 lysine residues, were made using standard techniques. 
The C- terminus of the peptides contained a 2 glycine 
spacer followed by a cysteine. The peptides were 
immobilised onto Sulfolink Coupling gel (Pierce) via the 
C- terminal cysteine, at a concentration of Img/ml/ 
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Hela nuclear extract (Computer Cell Culture Centre, 
Belgium) was diluted in IPH buffer (SOmM Tris pH8, ISOmM 
NaCI, 5toM EDT A- 0.5% NP-40 v/v) to a final protein 
5 concentration of 4ing/ml and pre-cieared»- 

Affinity purifications were done with 5vil of sepharose- 
linked peptide and ISOpl of diluted Hela nuclear extract* 
Competitor peptide was added to a final concentration of 

10 150iig/ml, where used. The purifications were incubated 
for 90 minutes on a wheel at 4^C and washed 3 times in 
IPH before resolution on an 8% SDS-PA6B gel. For colvmin 
purification of concplex, 2 ml of diluted Hela nuclear 
extract was poured twice over a micro- column of lOOpl of 

IS sepharose-linked'unmethylated H3 peptide. The colimin was 
washed with 4 column voltimes of IPH and bound proteins 
were eluted in 250raM NaCl IPH. Eluted fractions 
containing the complex were made up to 15 Dpi in IPH and 
rebound to 5vil of H3 peptide sepharose, as. above. Bound 

20. proteins were visualised by silver staining as described 
previously (Morrissey (1981) 117(2) 307-310). 

Mass spectrometjry identification of proteins 

Proteins that rebound un-methylated peptide after column 

25 pxirif ication were excised from an 8% coomassie stained 
gel with a protein- free razor blade. Proteins were 
digested in-gel with tar/psin. After overnight incubation 
at 30^C, a O.B^il aliquot from the digest was spotted on 
the MALDI target for fingerprinting. The remaining digest 

30 was purified using C18 Zip-Tips prior ESI-MS. 

Peptide Mass Fingerprints were taken with a MALDI 
instrument from Micromas fitted with Delayed Ebctraction, 
using Alpha- cyano-cinnamic acid dissolved in 50% 
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Acetonitrile: 0.1% TPA as matrix- CID fragmentation 
spectra from peptides were taken with an Ion-trap 
instrument (LCQ-Decca) from ThermoQuest- Protein databases 
were 'searched with the mass spectrometry data using the 
5 . programs Mascot , http://matrixscience.com (Mass 
fingerprinting and fragmentation spectra) and 
Profound^http://129.85.19.192 (mass fingerprinting data) . 

Peace tylase assays 

10 Histone peptide affinity purifications from Hela nuclear 
extract were re-suspended in lOOpl of fresh IPH buffer 
containing 150,000 cpm of [^H] -labelled acetylated H4 
peptide. Reactions were incubated at 37<>C for 120 
minutes, with regular mixing. Reactions were stopped by 

15 the addition of 65^1 of acid mix ( 1 M HCl, 0.16 M acetic 
acid) . 700vil of ethylacetate was added and txibes were 
vortexed vigorously for 20 seconds. Phases were separated 
by centrifugation at 13500 rpm for 1 min. SOOul of the 
upper phase (ethylacetate) was removed and mixed with 1ml 

20 of scintillant (Optiphase™, Wallac) . Release of (^H] - 
acetate from the H4 peptide was .measured in counts per 
minute (cpra) using a scintillation counter (Beckmann 
LS6000SC) . 

25 Western blots, antibodies and immunoprecipitations 

Bound proteins were resolved on 8% SDS-PAGE gel, blotted 
to nitrocellulose and blocked overnight (4% non-fat 
railk/0.5% Tween 20 v/v) . Blots were probed with the 
following antibodies for 1 hour at room temperature: 

30 anti-Rbap48 (Genetex, llglO, Ivig/mi;, anti-DNMTl (NEB- 
231, 1 in 1000), anti-Sin3 (Santa Cruz 994, lug/ml) and 
anti-p60 (1 in 1000) . Blots were washed in blocking 
buffer, incvibated with HRP linked secondary antibodies 
(Abeam) and visualised with ECL (Amersham) . NuRD 
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immunoprecipi tat ions were done by incubating SOOpI of 
diluted Hela nuclear extract with anti-MTA2 antibody 
(Santa Cruz 9447: 1 \xg/ml) and- 15 ]il of protein A/G bead 
mix (Araersham) for 5-12 hours at 4<»C. 

5 

Precipitations were washed 3 times with IPH before 
resolution on a 20% SDS-PAGB gel. After blotting to 
nitrocellulose and blocking (5%BSA/0.5% Tween 20) for 1 
hour at room temperature, the blots were probed overnight 
10 with anti-MBD3 (Santa Cruz 9402, liig/ml) , anti-H3 
(500ng/ml) or anti-H4 (Abeam ab73 11, 500ng/ml) . 

Blots were washed briefly with BSA blocking buffer, 
incubated with HRP linked secondary antibody and 
15 visualised as above. 

Results 

S, cerevisiae SBTl and SET2 arig Histone 
Methyltransf erases (HMTs) 
20 PtA tagged SBTlp, SBT2p and SET6p were purified from S. 
cerevisiae by igG seipharose chromatography. Westeina 
blotting revealed a strong band in each of the eluted 
protein fractions corresponding to full-length PtA-SETlp, 
PtA-SBT2p and PtA-SET3p, respectively. 

25 

Approximately lOug of piirified PtA-SBTlp, PtA-SBT2p or 
PtA-SET6p were assayed for methyltransf erase activity on 
60ug of a mixture of calf thymus soltible histones (H2A, 
H2B, H3 and H4) or 20ug of recombinant xenopus histone H3 
30 in the presence of ("C-Me) S-adenosyl methionine. 

Autoradipgrams were developed after 48 hours escposure. 



No methylation was observed for PtA- SBTlp, PtA-SET2p or 
PtA-SET6p in the absence of calf or recombinant histones. 
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PtA-SETlp and PtA-SET2p were observed to methyl ate 
recombinant Xenopus histone H3. When the preparation of 
histones H2A, H2B, H3 and H4 was used/ PtA-SETlp and 
5 PtA-SET2p were, observed to specifically methylate histone 
H3, 

Nq methylation activity .was observed for FtA-SET6p in the 
presence of histones H2Ai H2B, H3 and H4 or the presence 
10 of recombinant Xenopus histone H3. 

The Methyltransf erase Activity of SETl is Specific for 
Lysine 4 ■ 

. A truncated recombinant Xenopus H3 histone containing 
IS lacking the first 26 N terminal residues was compared 
with the full length Xenopus H3 histone as a substrate 
for PtA-SBTlp and PtA-SET2p. 

For each assay, 20iig of reconibinauit . Xenopus H3 histone 
20 andlOvig of PtA-SBTlp were incubated in the presence of 

{^^C-Me) S-adenosyl methionine* Autoradiograms showing the 
presence of "c label were developed after 48 hoiirs. The 
presence of substrates on the gel was confirmed by 
Ponceau staining. 



25 



No methylation was observed in the absence of a histone 
H3 siobstrate. 



Both PtA-SBTlp and PtA-SET2p were observed to methylate 
30 full length Xenopus histone H3. 



No methylation by PtA-SETlp of the Xenopus histone H3 
with the N terminal deletion was obsesrved but PtA-SET2p 
was observed to methylate this truncated protein. 
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These results demonstrate that PtA-SETlp methylates 
his tone H3 within the N terminal 26 amino acid residues 
of histone H3 whereas PtA-SBT2p methylates outside this 
S amino tail region, within the core of the histone H3 
polypeptide. 

Mapping of the Methylation Site within the H3 Amino Tail 
Xenopus H3 histone labelled with a ^*C-methyl group by 

10 PtA-SETlp as described above, was sequenced by sequential 
Bdman degradation and fractions corresponding to each 
amino acid cycle were collected and counted by 
scintillation coxinting. The results are shown in Figure 
3 . The N terroinal sequence of Xenopus histone H3 is shown 

15 \mdemeath corresponding fraction. 

These results show that the lysine 4 residue of histone 
H3 is specifically methylated by PtA^-SBTlp, No 
methylation of lysine 9 is observed* 

20 

SET polypeptides as described herein are shown to 
specifically methylate histone H3 at residues other than 
lysine 9. 

25 SETl polypeptides methylate the lysine 4 residue while 
SET2 polypeptides methylate the core of histone H3, 
outside the eunino tail. 

The histone H3 N- terminus binds a set of proteins when 
30 unmethylated at lysine 4 

Differently methylated histone peptides were used in pull 
downs assays from Hela nuclear extract and the bound 
proteins analysed by silver stain. 
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The histone peptides used were trimethylated at lysine 4 
(K4) , lysine 9 (K9) or both (K4+K9) and two control 
peptides of uxiraethylated H3 and H4 tails (Pigvire. 5). 
These peptides were immobilised onto sepharose beads via 
5 a C- terminal cysteine residue and used to affinity purify 
proteins from Hela nuclear extract. 

Although some proteins were found to specifically bind to 
the methylated peptides, a number of proteins were also 
10 obsearved to be specifically purified by the unmethylated 
H3 peptide. 

This set of proteins was also observed to be purified by 
the K9 methylated H3 peptide, but not by a peptide 

15 methylated at K4, or K4 and K9, or by an unmethylated H4 
peptide. This indicates that these proteins only bind to 
histone H3 N- termini when lysine 4 is xmxnethylated and 
that methylation of lysine 9 does not disrupt the 
binding of the complex. 

20 . 

To confirm the specificity of the proteins binding to 
unmethylated H3 peptide, pull downs were done in the 
presence or absence of conqpetitor peptide. The set of 
proteins bound to the ixdmethylated peptide was observed 

25 to be competed away by peptides unmethylated at K4, but 
not by peptides with a* methylated K4. 

The set of proteins bound to ijmmethylated H3 tails is the 
NuRD complex 

30 The set of proteins, putatively components of a binding 
complex, identified above were purified from Hela nuclear 
extract over a column of unmethylated H3 peptide. The 
column-purified proteins, once eluted in high salt, were 
rebovmd to unmethylated peptide in a pull down assay. The 



WO02A)90S78 ^jjj^ ^^Crr/GB02A)2050 

rebound proteins were resolved by SDS-PAGE and silver 
stained. This purified so\irce of the putative coniplex was 
used for mass spectrometry identification of the 
components • 

5 

The svib-units of the complex identified in this way 
include the histone deacetylases HDACl/2, the ATPase 
chromatin-remodelling enzyme Mi*2p, the RB associated 
proteins Rbap48/46, the metastasis -associated antigens 
10 and the methyl CpG binding domain MBD3. These are 

all known components of' the NuRD complex (Zhang et al 
(1998) Cell 95(2) 279-289, Zhang et al (1999) Genes Dey 
13(15) 1924-1935, Xue et al (1998) Mol Cell 2(6) 851- 
861) . 

15 

Several components of the NuRD complex have been 
identified in other deacetylase complexes (Knoepfler, P. 
S. et al (1999) Cell 99, 447-450). The purified complex 
with antibodies against other deacetylase complex 

20 components. Rbap48 protein was detected, whereas DNMTl 
auid Sin3 (present in distinct deacetylase complexes: 
Robertson, K. D. et al (2000) JNTat Genet 25 (3) ,338-42 . 
22, Zhang, Y et al(1997) Cell 89 (3) ,357-64) , were absent 
from the H3 purified complex . Rbap48 is also 

25 present in the CAF-l (chromatin assembly factor 1) 
complex (Ridgway, P. et al (2000) J Cell Sci 113 {pt 
15) ,2647-58.) . However a western blot with the p60 
subunit of CAP-1 did not detect CAP-1 in the H3 purified 
coECtplex . 



30 



To establish if NuRD has deacetylase activity when 
affinity purified on unmethylated H3 peptide, peptide 
pull downs were assayed for deacetylase activity • 
Unmethylated H3 peptide was foimd to associate with a 
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significant level of histone deacetylase (HDAC) activity 
(Figure 6) • In contrast, a peptide methylated at K4 
pulled dovm background levels of HDAC activity. This 
indicates that the deacetylase activity of the NuRD 
complex is not compromised when it is associated with the 
iiimiethylated histone H3 tail. 

NuRD complex associates with native histone H3 only when 
ixamethylated at lysine 4 

Previous studies have shown that the NuRD complex can be 
iramianoprecipitated using anti-MTA2 antibodies (Zhang et 
al (1999) supra) . MT A2 antibodies were observed to 
precipitate MBD3 (a component of NuRD) as well as 
histone H3 and H4^ as detected by western blot with the 
respective antibodies'* 

The imraunoprecipitated histones were found to be 
efficiently competed away by the unmethylated 
peptide, but not by the K4 methylated peptide. These data 
indicate that the NuRD coicqplex associates with 
unmethylated native histone H3 (and interacts 
indirectly with H4 via H3) . . 

Methylation Status of K4 of Histone H3 

The e amino group of lysine residues may be mono-, di- or 
tri -methylated (me) . Antibodies were raised to 
specifically recognize either the di-me or tri-me state 
of K4 H3. An antibody raised against di-me K4 was found 
to recognize H3 from a purified soxxrce of histones and ±e 
effectively competed by a di-me but not a tri-me K4 H3 
peptide. An antibody raised against tri-me K4 was found 
to be specifically competed by the tri-rae but hot di-me 
K4 H3 peptide. Both antibodies were found to selectively 
recognise histone H3 in a total yeast extract. Neither 
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antibody recognised H3 from a yeast strain containing a 
K4 H3 mutation to alanine (K4A) and neither antibody 
recognised bacterially produced recombinant, tinmethylated 
H3. • " , 

5 , 

Both Di- and Tri-me K4 antibodies were found to recognise 
histone H3 after it has been methylated by SBTl. 
Deletion of SETl abolishes recognition of histone H3 by 
the Di- and Tri-me K4 antibodies, whereas deletion of 
10' other 3. cerevisiae SET genes (SBT2, 3, . 4, 5 and 6) has 
no effect. 

To establish if the bi- or Tri-me state is linked to gene 
activity, we investigated whether SETI caui act as an 
IS activator of transcription and identified genes 
positively regulated by SETl. 

Transcriptional profiling of yeast deficient in SETl waS 
therefore carried out and 200 genes were identified whose 

20 activity is reduced in the ctbsence of SETl. Figure 7 

shows a list of the top twenty genes whose transcription 
is reduced between 1.6 and 2,1 fold in setlA strain 
relative to WT yeast. Northern blot analysis of one of 
these genes PPH3, confirms that the mRNA expression of 

25 this gene is indeed reduced in the setlA strain. This 
indicates that SETl positively regulates transcription. 

We next established whether the SETl activated genes were 
methylated at K4H3 and identified the methylation state 
30 of K4. The PPH3 gene, as well as two of the other SETl- 
dependent genes, KAMI and NUP 110, were investigated. The 
use of Di-rae and Tri-me specific antibodies in chromatin 
iraraunoprecipitation (ChiPs) showed that all three genes 
have Di- and Tri-me K4 H3 associated with their promoter. 
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Since the expression profiling was done from yeast grown 
in rich medium, iall three of these genes, and indeed all 
the SETl -regulated genes in Figure 7, are const i tut ively 
5 active. 

The K4 H3 methylation status of a set of genes whose 
activity can be regulated was then investigated using 
genes that can be activated by either inositol or 

10 methionine deprivation. When the INOl gene is repressed, 
K4 H3 was found to be bi-me but not Tri-me. However/ when 
the gene is active, K4 H3 Di-me was still observed but 
Tri-me also became apparent. Two other inositol regulated 
genes IN02 and IN04 also showed the presence of Tri-me K4 

15 H3 only when active. 

Analysis of an independent metabolic pathway, where genes 
are actively transcribed in the absence of methionine, 
showed that the MBT16 gene has Tri-me K4 H3 only when the 

20 gene is active and Di-me only when repressed. Another 
methionine regulated gene, MBTI7, showed no Di-me K4H3, 
either when active or repressed but tri-me K4 H3 appeared 
when the gene became active. The K4 methylation patterns 
we observed at all the genes analysed here were not • 

25 restricted to promoter regions but were also observed 
when probes in the coding region were used. 

These results (svimmarized in Table 2) indicate that the 
trimethyl state of K4 H3 is \mambiguously linked to 
30 active transcription. When genes are repressed, tri-me K4 
H3 is not observed. In contrast, Di-me K4 H3 is often 
present in repressed genes but is also present on some 
active genes. This is consistent with previously reported 
findings at the chicken beta-globin locus, where Di-me K4 
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H3 is detected on large euchromatlc regions enpompassing 
both active and inactive genes (Litt et al Science 293 
2453-5 (2001)}. Di-me K4 H3 may determine a 
transcriptionally active state "poised** for stimulation. 

5 

These findings demonstrate that SBTl is an activator of 
transcription and uncover a new level of regulation of K4 
H3 function, by virtue of ttcethylation status. 

10 Methylation of histone H3 by SET polypeptides plays an 
important role in the regulation of transcriptional 
activity and the control of cellular proliferation. 
Manipulation of the SET polypeptide histone 
methyltransf erase activity may therefore be used in the 

15 treatment of conditions associated with cellular 
proliferation, such as cancer. 
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Claims ; 



10 



15 



20 



25 



1. Aa assay method for an agent which modulates the 



method including: 

(a) bringing into contact a SET polypeptide and a 
histone H3 polypeptide in the' presence of a test 
conqpound; and, 

(b) determining interaction between the histone H3 
polypeptide and the SET. polypeptide. 

2 • An assay method, for an agent which modulates the 
histone H3 methylase activity of a SET polypeptide, the 
method including: 

(a) bringing into contact a SET polypeptide and a test* 
compound; and, 

(b) determining the histone H3 methylase activity of the 
SET polypeptide. 

3. An assay method according to claim 2 wherein the SET 
polypeptide and the test compound are brought into 
contact in the presence of a histone H3 polypeptide, . emd; 
the histone H3 methylase -activity is determined by 
detenaining the presence of one or more methyl groups at 
a lysine residue of the histone H3 polypeptide. 

4. An assay method according to claim 3 wherein said • 
lysine residue is selected from the group consisting of 
lysine 4, lysine 27, lysine 36, lysine 37, lysine 42, 
lysine 56, lysine 64, lysine 79, lysine 115, lysine 121, 
lysine 122 and lysine 125. 



interaction of histone H3 with a SET polypeptide, the 



5. An assay method according to claim 3 wherein said 
lysine residue is lysine 4. 
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6. An assay method according to claim 5 wherein the SET 
polypeptide is a SBTl polypeptide. 

7. An assay method according to claim 6 wherein the di 
S and/or tri methylation o£ the Lys 4 residue by the SET 1 

polypeptide is determined. 

8. An assay method according to claim 7 comprising 
determining the transcriptional activity of chromatin 

10 comprising said histone. 

9. An assay method according to any one of claims 2 to 
4 wherein the SET polypeptide is a SET2 polypeptide and 
methylation of a lysine residue of said histone H3 

15 polypeptide selected from the group consisting of lysine 
27, lysine 36, lysine 37, lysine 42, lysine 56, lysine 
64, lysine 79, lysine 115, lysine 121, lysine 122 and 
lysine 125 is determined. 

20 Id. A method according to any one of claims 1 to 9 

comprising identifying and/or obtaining a test compound 
which modulates the histone H3 methylase activity of a 
SET polypeptide. 

25 11. A method according to any one of claims 1 to 10 

further conqprising; testing for a biological function of 
methylated histone H3. 

12. A method according to claim 11 wherein said 
30 biological functioh is transcriptional activation. 



13. A method according to any one of claims 1 to 10 
further comprising; testing for the absence of a 
biological function of an unmethylated histone H3. 
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14. A method according to claim 13 wherein said 
biological function is binding to NuRD complex. 

5* 15. A method' according to any one of claims 2 to 14 

comprising isolating and/or purifying the test compound. 

16* A method ciccording to claim 15 comprising 
formulating said compound with a pharmaceutically 
10 acceptable excipient. 

17. A SET polypeptide fragment having histpne H3 
methylase activity. 

15 18. A fragment according to claim 17 which is a SfilTl 

polypeptide fragment which methylates lysine 4 of histone 
H3 . 

19. A fragment according to claim 18 wherein said SETl 
20 polypeptide is hSBT-lP (humeui KIAA0339) or hSET-2P (human 

KIAA1076) . 

20. A fragment according to claim 19 wherein said SET 
polypeptide is a SET2 polypeptide which methylates a 

25 lysine residue of the histone H3 core. 

21. A fragment according to claim 20 wherein said SET2 
polypeptide is hSET2-l, hSET2-2, hSET2-3 or WHSCl. 

30 22. A method of producing a SET polypeptide comprising; 
expressing a SET polypeptide from encoding nucleic acid, 
and; 

determining the methyltrcoxsf erase activity of the 
expressed SET polypeptide. 
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23. A agent which modulates SBT polypeptide activity 
obtained by a method according to any one of claims 1 to 
16. 

5 24. A phaxnnaceutical composition comprising an agent 
according to claim 23 . 

25. A method of making a pharmaceutical composition 
comprising admixing a substeuice according to claim 23 

10 with a pharmaceutically acceptable excipient, vehicle or 
carrier, eaid optionally other ingredients. 

26. A method of treatment of a disorder of cellular 
proliferation which includes administering to a patient 

IS . an agent according to claim 23. 

27. Use of an agent according to claim 23 in the 
manufacture of a medicament for treatment of a 
proliferative disorder. 

20 

28. An assay method for an agent which modulates the 
interaction of histone H3 with a NuIQ) repressor, 

the method including: 

(a) bringing into contact one or more components of the 
25 NuRD repressor complex and a histone H3 polypeptide in 

the presence of a test compotind; and, 

(b) determining interaction between the histone H3 
polypeptide and the said one or more conq^nents. 
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30 29. A method according to claim 28 wherein the histone 
H3 polypeptide is unmethylated at histone H3. 

30. A method of identifying transcriptionally active 
chromatin comprising; 
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determining the presence or absence of a 



trimethylated Lys 4 residue in a histone H3 polypeptide 
of said chromatin. 

S 31. A method according to claim 31 wherein said method 
comprises; contacting said chromatin region with an 
antibody which binds specifically to a histone H3 
polypeptide which is trimethylated at Lys 4. 



10 
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1: H3(l-I6) Uninethylated I AimCQTARKSTGOKAP 16 

2: H3 (1-16) K4+K9 Methylated AKnCQfTARKSTOGKAP 
3: H3(M6) K9 Methylated ARIKQriARKSTGGKAP 

AKnCQTARKSTGOKAP 



4: H3 (1-15) K4 Methylated 
5: H4(l-14) Unmethylated 
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